Model Selection

Multilingual Visual Reasoning

# Multilingual Visual Reasoning

Internvl3 38B Instruct GGUF

InternVL3-38B-Instruct is an advanced Multimodal Large Language Model (MLLM) that demonstrates exceptional overall performance, with strong multimodal perception and reasoning capabilities.

Llama 4 Maverick 17B 128E Instruct

Llama 4 Maverick is a 17-billion-parameter multimodal Mixture of Experts (MoE) model from Meta, supporting 12 languages and image understanding, suitable for commercial and research applications.

Multimodal Fusion

Transformers Supports Multiple Languages

Trillion LLaVA 7B FP16

Trillion-LLaVA-7B is a vision-language model with image understanding capabilities, trained on English visual-language instruction pairs, demonstrating exceptional cross-lingual visual reasoning abilities.

Transformers Supports Multiple Languages

Internvl3 1B AWQ

InternVL3-1B is a multimodal large language model in the InternVL3 series, featuring exceptional multimodal perception and reasoning capabilities.

Transformers Other

InternVL3-1B is a 1B-parameter multimodal large language model in the InternVL3 series, integrating the InternViT visual encoder and Qwen2.5 language model, with exceptional multimodal perception and reasoning capabilities.

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase